Experimentation as a way of life: Okapi at TREC

نویسندگان

  • Stephen E. Robertson
  • Steve Walker
  • Micheline Hancock-Beaulieu
چکیده

The Okapi system has been used in a series of experiments on the TREC collections, investigating probabilistic models, relevance feedback and query expansion, and interaction issues. The TREC-6 ad hoc task was used to test an application of a new relevance weighting formula, which takes account of documents judged nonrelevant. The application was to a form of blind feedback (using the top-ranked documents from an initial search to improve the query formulation for a subsequent search, without actual relevance feedback, on the assumption that these top-ranked documents are likely to be relevant). In the routing task, the problem is one of query optimization based on a training set with known relevant documents; investigations for TREC-6 included using a form of simulated annealing for this purpose. A signi®cant feature of this work is the need to avoid over®tting of the training sample. In the interactive track, methodology remains the major problem: we do not yet know how to conduct controlled laboratory experiments which provide good information about information retrieval interaction. The Okapi team has been particularly interested in the relation between the functionalities associated with relevance feedback and the ability of searchers to make use of these functionalities. TREC provides an excellent environment and set of tools for investigating automatic systems; its value for interactive systems is not yet proven. # 1999 Elsevier Science Ltd. All rights reserved. Information Processing and Management 36 (2000) 95±108 0306-4573/99/$ see front matter # 1999 Elsevier Science Ltd. All rights reserved. PII: S0306-4573(99)00046-1 www.elsevier.com/locate/infoproman * Corresponding author. Present address: Microsoft Resarch Ltd, St George House, 1 Guildhall Street, Cambridge CB2 3NH, UK. Tel.: +44-1223-744-769; fax: +44-1223-744-777. E-mail address: [email protected] (S.E. Robertson). 1 Present address: Microsoft Research Ltd, St George House, 1 Guildhall Street, Cambridge CB2 3NH, UK. 2 Present address: University of Sheeld, UK.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

UCLA-Okapi at TREC-2: Query Expansion Experiments

This is the rst participation of the Graduate School of Library and Information Science, University of California at Los Angeles in the TREC Conference. For TREC{2, Category B, UCLA used a version of the Okapi text retrieval system that was made available to UCLA by City University, London, UK. OKAPI has been described in TREC1 (Robertson, Walker, Hancock-Beaulieu, Gull & Lau, 1993a) as well as...

متن کامل

Okapi at TREC-5

City submitted two runs each for the automatic ad hoc, very large collection track, automatic routing and Chinese track; and took part in the interactive and ltering tracks. There were no very signi cant new developments; the same Okapi-style weighting as in TREC{3 and TREC{4 was used this time round, although there were attempts, in the ad hoc and more notably in the Chinese experiments, to ex...

متن کامل

MultiText Legal Experiments at TREC 2008

Our TREC 2008 e ort used fusion IR methods identical to those used for our TREC 2007 e ort; in addition we used logistic regression to attempt to learn the optimal K value for the primary F1@K measure introduced at TREC 2008. We used the Wumpus search engine combining several methods that have proven successful, including cover density ranking and Okapi BM25 ranking, and combination methods. St...

متن کامل

TREC 14 Enterprise Track at CSIRO and ANU

By the time of submission deadline, we completed two tasks: known-item search and discussion search. For both tasks, we used the PADRE retrieval system [1], in which the Okapi BM25 relevance function was implemented. Each message in the collection was treated as an independent document, so both topic distillation scoring and same site suppression mechanism were turned off (i.e. -nocool and –SSS...

متن کامل

Interactive Okapi at Sheffield - TREC-8

The focus of the study was to examine searching behaviour in relation to the three experimental variables, i.e. searcher, system and topic characteristics. Twenty-four subjects searched the six test topics on two versions of the Okapi system, one with relevance feedback and one without. A combination of data collection methods was used including observations, verbal protocols, transaction logs,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Inf. Process. Manage.

دوره 36  شماره 

صفحات  -

تاریخ انتشار 2000